Picture for Ran Xu

Ran Xu

RUBRIC-ARROW: Alternating Pointwise Rubric Reward Modeling for LLM Post-training in Non-verifiable Domains

Add code
May 27, 2026
Viaarxiv icon

AutoResearch AI: Towards AI-Powered Research Automation for Scientific Discovery

Add code
May 22, 2026
Viaarxiv icon

Towards Camera-Robust 3D Localization: Equation-Anchored Tool-Use for MLLMs

Add code
May 19, 2026
Viaarxiv icon

VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation

Add code
Apr 23, 2026
Viaarxiv icon

MTA-Agent: An Open Recipe for Multimodal Deep Search Agents

Add code
Apr 07, 2026
Viaarxiv icon

How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical Generative Reasoning

Add code
Mar 25, 2026
Viaarxiv icon

Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training

Add code
Feb 02, 2026
Viaarxiv icon

Future Optical Flow Prediction Improves Robot Control & Video Generation

Add code
Jan 15, 2026
Viaarxiv icon

Robotic VLA Benefits from Joint Learning with Motion Image Diffusion

Add code
Dec 19, 2025
Viaarxiv icon

Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning

Add code
Oct 27, 2025
Figure 1 for Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
Figure 2 for Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
Figure 3 for Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
Figure 4 for Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
Viaarxiv icon